Communicative speech synthesis with XIMERA: a first step

نویسندگان

  • Shinsuke Sakai
  • Jinfu Ni
  • Ranniery Maia
  • Keiichi Tokuda
  • Minoru Tsuzaki
  • Tomoki Toda
  • Hisashi Kawai
  • Satoshi Nakamura
چکیده

This paper presents a corpus-based approach to communicative speech synthesis. We chose “good news” style and “bad news” style for our initial attempt to synthesize speech that has appropriate expressiveness desired in human-human or human-machine dialog. We utilized 10-hour “neutral” style speech corpus as well as smaller corpora with good news and bad news styles, each consisting of two to three hours of speech from the same speaker. We trained target HMM models with each style and synthesized speech with unit databases containing speech with the relevant style as well as neutral speech. From the listening tests, we found out that intended communicative styles were comprehended by listeners and that considerably high mean opinion score on naturalness was achieved with rather small, style-specific corpora.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XIMERA: a new TTS from ATR based on corpus-based technologies

This paper describes a new concatenative TTS system under development at ATR. The system, named XIMERA, is based on corpus-based technologies, as was the case for the preceding TTS systems from ATR, namely ν-talk and CHATR. The prominent features of XIMERA are (1) large corpora (a 110hours corpus of a Japanese male, a 60-hours corpus of a Japanese female, and a 20-hours corpus of a Chinese fema...

متن کامل

Listening-Test-Based Annotation of Communicative Functions for Expressive Speech Synthesis

This paper is focused on the evaluation of listening test that was realized with a view to objectively annotate expressive speech recordings and further develop a limited domain expressive speech synthesis system. There are two main issues to face in this task. The first matter in issue to be taken into consideration is the fact that expressivity in speech has to be defined in some way. The sec...

متن کامل

Fiction in the Context of Developing Students' Professional and Communicative Competencies (in the Field of Hospitality)

The article discusses the methodological potential of fiction in developing professional and communicative competencies of Hospitality students based on interdisciplinary approach. The study focuses on the most actual aspects of speech culture and describes the ways of developing professionally oriented communicative competencies and the basics of professional speech training of bachelors in th...

متن کامل

Expressive Speech Synthesis for Czech Limited Domain Dialogue System – Basic Experiments

This paper describes a development of limited domain expressive speech synthesis for the Czech language. Our current speech synthesis system is based on unit selection methods and produces high quality speech in a neutral speaking style. This work focuses on modifications made in the synthesis algorithm to integrate expressivity into generated speech. There is also introduced a listening test, ...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007